智能论文笔记

Patch-level instance-group discrimination with pretext-invariant learning for colitis scoring

Ziang Xu , Sharib Ali , Soumya Gupta , Simon Leedham , James E East , Jens Rittscher

分类：计算机视觉 | 人工智能

2022-07-11

炎症性肠病（IBD），尤其是溃疡性结肠炎（UC），由内镜医生分级，该评估是风险分层和治疗监测的基础。目前，内窥镜表征在很大程度上取决于操作员，导致IBD患者有时不良的临床结果。我们专注于广泛使用但需要可靠地鉴定粘膜炎症变化的蛋黄酱内窥镜评分（MES）系统。大多数现有的深度学习分类方法无法检测到这些细粒度的变化，从而使UC的分级成为一项具有挑战性的任务。在这项工作中，我们介绍了一个新颖的贴片级实例组歧视，并使用借口 - 不变的表示学习（PLD-pirl）进行自我监督学习（SSL）。我们的实验表明，与基线监督网络和几种最先进的SSL方法相比，准确性和鲁棒性提高了。与基线（RESNET50）监督分类相比，我们提出的PLD-pirl在Hold-Out测试数据中获得了4.75％的改善，而在看不见的中心测试数据中获得了6.64％的速度，以获得TOP-1的准确性。

translated by 谷歌翻译

PolypGen: A multi-center polyp detection and segmentation dataset for generalisability assessment

Sharib Ali , Debesh Jha , Noha Ghatwary , Stefano Realdon , Renato Cannizzaro , Osama E. Salem , Dominique Lamarque , Christian Daul , Michael A. Riegler , Kim V. Anonsen

分类：计算机视觉 | 机器学习

2021-06-08

在结肠息肉是众所周知的如通过结肠镜检查鉴定的癌症的前体或者有关诊断工作为症状，结肠直肠癌筛查或某些疾病的系统的监视。虽然大部分息肉是良性的，在数量，尺寸和息肉的表面结构是紧密相连的结肠癌的风险。有高的漏检率和不完全去除结肠息肉的存在由于可变性质，困难描绘异常，高复发率和结肠的解剖外形。过去，多种方法已建成自动化息肉检测与分割。然而，大多数方法的关键问题是，他们没有经过严格的大型多中心的专用数据集进行测试。因此，这些方法可能无法推广到不同人群的数据集，因为他们过度拟合到一个特定的人口和内镜监控。在这个意义上，我们已经从整合超过300名患者6个不同的中心策划的数据集。所述数据集包括与由六名高级肠胃验证息肉边界的精确划定3446个注释息肉标签单帧和序列数据。据我们所知，这是由一组计算科学家和专家肠胃的策划最全面的检测和像素级的细分数据集。此数据集已在起源的Endocv2021挑战旨在息肉检测与分割处理可推广的一部分。在本文中，我们提供全面的洞察数据结构和注释策略，标注的质量保证和技术验证我们的扩展EndoCV2021数据集，我们称之为PolypGen。

translated by 谷歌翻译

Don't Forget Your ABC's: Evaluating the State-of-the-Art in Chat-Oriented Dialogue Systems

Sarah E. Finch , James D. Finch , Jinho D. Choi

分类：自然语言处理

2022-12-18

There has been great recent advancement in human-computer chat. However, proper evaluation currently requires human judgements that produce notoriously high-variance metrics due to their inherent subjectivity. Furthermore, there is little standardization in the methods and labels used for evaluation, with an overall lack of work to compare and assess the validity of various evaluation approaches. As a consequence, existing evaluation results likely leave an incomplete picture of the strengths and weaknesses of open-domain chatbots. We aim towards a dimensional evaluation of human-computer chat that can reliably measure several distinct aspects of chat quality. To this end, we present our novel human evaluation method that quantifies the rate of several quality-related chatbot behaviors. Our results demonstrate our method to be more suitable for dimensional chat evaluation than alternative likert-style or comparative methods. We then use our validated method and existing methods to evaluate four open-domain chat models from the recent literature.

translated by 谷歌翻译

Predicting Autonomous Vehicle Collision Injury Severity Levels for Ethical Decision Making and Path Planning

James E. Pickering , Keith J. Burnham

分类：人工智能

2022-12-16

Developments in autonomous vehicles (AVs) are rapidly advancing and will in the next 20 years become a central part to our society. However, especially in the early stages of deployment, there is expected to be incidents involving AVs. In the event of AV incidents, decisions will need to be made that require ethical decisions, e.g., deciding between colliding into a group of pedestrians or a rigid barrier. For an AV to undertake such ethical decision making and path planning, simulation models of the situation will be required that are used in real-time on-board the AV. These models will enable path planning and ethical decision making to be undertaken based on predetermined collision injury severity levels. In this research, models are developed for the path planning and ethical decision making that predetermine knowledge regarding the possible collision injury severities, i.e., peak deformation of the AV colliding into the rigid barrier or the impact velocity of the AV colliding into a pedestrian. Based on such knowledge and using fuzzy logic, a novel nonlinear weighted utility cost function for the collision injury severity levels is developed. This allows the model-based predicted collision outcomes arising from AV peak deformation and AV-pedestrian impact velocity to be examined separately via weighted utility cost functions with a common structure. The general form of the weighted utility cost function exploits a fuzzy sets approach, thus allowing common utility costs from the two separate utility cost functions to be meaningfully compared. A decision-making algorithm, which makes use of a utilitarian ethical approach, ensures that the AV will always steer onto the path which represents the lowest injury severity level, hence utility cost to society.

translated by 谷歌翻译

System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games

Indranil Sur , Zachary Daniels , Abrar Rahman , Kamil Faber , Gianmarco J. Gallardo , Tyler L. Hayes , Cameron E. Taylor , Mustafa Burak Gurbuz , James Smith , Sahana Joshi

分类：机器学习 | 人工智能

2022-12-08

As Artificial and Robotic Systems are increasingly deployed and relied upon for real-world applications, it is important that they exhibit the ability to continually learn and adapt in dynamically-changing environments, becoming Lifelong Learning Machines. Continual/lifelong learning (LL) involves minimizing catastrophic forgetting of old tasks while maximizing a model's capability to learn new tasks. This paper addresses the challenging lifelong reinforcement learning (L2RL) setting. Pushing the state-of-the-art forward in L2RL and making L2RL useful for practical applications requires more than developing individual L2RL algorithms; it requires making progress at the systems-level, especially research into the non-trivial problem of how to integrate multiple L2RL algorithms into a common framework. In this paper, we introduce the Lifelong Reinforcement Learning Components Framework (L2RLCF), which standardizes L2RL systems and assimilates different continual learning components (each addressing different aspects of the lifelong learning problem) into a unified system. As an instantiation of L2RLCF, we develop a standard API allowing easy integration of novel lifelong learning components. We describe a case study that demonstrates how multiple independently-developed LL components can be integrated into a single realized system. We also introduce an evaluation environment in order to measure the effect of combining various system components. Our evaluation environment employs different LL scenarios (sequences of tasks) consisting of Starcraft-2 minigames and allows for the fair, comprehensive, and quantitative comparison of different combinations of components within a challenging common evaluation environment.

translated by 谷歌翻译

Astronomia ex machina: a history, primer, and outlook on neural networks in astronomy

Michael J. Smith , James E. Geach

分类：机器学习

2022-11-07

In recent years, deep learning has infiltrated every field it has touched, reducing the need for specialist knowledge and automating the process of knowledge discovery from data. This review argues that astronomy is no different, and that we are currently in the midst of a deep learning revolution that is transforming the way we do astronomy. We trace the history of astronomical connectionism from the early days of multilayer perceptrons, through the second wave of convolutional and recurrent neural networks, to the current third wave of self-supervised and unsupervised deep learning. We then predict that we will soon enter a fourth wave of astronomical connectionism, in which finetuned versions of an all-encompassing 'foundation' model will replace expertly crafted deep learning models. We argue that such a model can only be brought about through a symbiotic relationship between astronomy and connectionism, whereby astronomy provides high quality multimodal data to train the foundation model, and in turn the foundation model is used to advance astronomical research.

translated by 谷歌翻译

Ontologizing Health Systems Data at Scale: Making Translational Discovery a Reality

Tiffany J. Callahan , Adrianne L. Stefanski , Jordan M. Wyrwa , Chenjie Zeng , Anna Ostropolets , Juan M. Banda , William A. Baumgartner Jr. , Richard D. Boyce , Elena Casiraghi , Ben D. Coleman

分类：人工智能

2022-09-10

通用数据模型解决了标准化电子健康记录（EHR）数据的许多挑战，但无法将其集成深度表型所需的资源。开放的生物学和生物医学本体论（OBO）铸造本体论提供了可用于生物学知识的语义计算表示，并能够整合多种生物医学数据。但是，将EHR数据映射到OBO Foundry本体论需要大量的手动策展和域专业知识。我们介绍了一个框架，用于将观察性医学成果合作伙伴关系（OMOP）标准词汇介绍给OBO铸造本体。使用此框架，我们制作了92,367条条件，8,615种药物成分和10,673个测量结果的映射。域专家验证了映射准确性，并且在24家医院进行检查时，映射覆盖了99％的条件和药物成分和68％的测量结果。最后，我们证明OMOP2OBO映射可以帮助系统地识别可能受益于基因检测的未诊断罕见病患者。

translated by 谷歌翻译

Macroeconomic Predictions using Payments Data and Machine Learning

James T. E. Chapman , Ajit Desai

分类：机器学习 | (统计)机器学习

2022-09-02

预测经济的短期动态 - 对经济代理商决策过程的重要意见 - 经常在线性模型中使用滞后指标。这通常在正常时期就足够了，但在危机期间可能不足。本文旨在证明，在非线性机器学习方法的帮助下，非传统和及时的数据（例如零售和批发付款）可以为决策者提供复杂的模型，以准确地估算几乎实时的关键宏观经济指标。此外，我们提供了一组计量经济学工具，以减轻机器学习模型中的过度拟合和解释性挑战，以提高其政策使用的有效性。我们的模型具有付款数据，非线性方法和量身定制的交叉验证方法，有助于提高宏观经济的启示准确性高达40 \％ - 在COVID-19期间的增长较高。我们观察到，付款数据对经济预测的贡献很小，在低和正常增长期间是线性的。但是，在强年或正增长期间，付款数据的贡献很大，不对称和非线性。

translated by 谷歌翻译

HTML版本

Evaluating Diverse Knowledge Sources for Online One-shot Learning of Novel Tasks

James R. Kirk , Robert E. Wray , Peter Lindes , John E. Laird

分类：人工智能

2022-08-19

在线自主代理能够利用各种潜在的任务知识来源；但是，目前的方法总是只关注一两个。在这里，我们调查了利用多样化知识源以一记模拟的家用移动机器人的新任务学习的挑战和影响。在SOAR认知体系结构中开发的最终代理使用以下域和任务知识来源：与环境的互动，任务执行和规划知识，人类自然语言指导以及从大语言模型（GPT-3）检索到的响应。我们探讨了这些知识来源的不同贡献，并在学习正确的任务知识，人力工作量和计算成本方面评估了不同组合的性能。结合所有来源的结果表明，整合可以在计算成本和人力工作量方面改善一声任务学习。

translated by 谷歌翻译

A Macrocolumn Architecture Implemented with Temporal (Spiking) Neurons

James E. Smith

分类：神经与进化计算 | 机器学习

2022-07-11

从自下而上的计算大脑反向构造的长期目标是，本文档的重点是杂色抽象层。首先用状态机器模型描述其操作，开发了基本的宏观体系结构。然后使用支持时间计算的尖峰神经元实现状态机函数。神经元模型基于活跃的尖峰树突，并反映了Hawkins/Numenta神经元模型。通过研究基准来证明该体系结构，其中代理使用宏collumn首先学习，然后导航2-D环境，其中包含随机放置的功能。环境在宏collumn中表示为标记的有向图，其中边缘连接特征，标签表示它们之间的相对位移。

translated by 谷歌翻译